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(54) Dual storage controllers 

(57) A primary controller (104) operates to transmit 
write data and a write time to a secondary controller 
(109) in the earlier sequence of the write times after 
reporting a completion of a request for write to a 
processing unit (100). The secondary controller (109) 
stores the write data and the write time transmitted from 
the primary controller (104) in the cache memory (108). 
At a time, the secondary controller (1 09) stores the write 
data in a disk unit (105) in the earlier sequence of the 
write time. These operations make it possible to guaran- 
tee ail the write data on or before the reference time. 
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Description 

BACKGROUND OF THE INVENTION 

[0001 ] The present invention relates to dual writing of 
data to be executed through the effect of two controllers. 
In particular, the present invention is effective in the 
case that a long distance exists between the two con- 
trollers so that a delay takes place in transferring data 
between the controllers. 

[0002] The following techniques have been disclosed 
as the prior art of the present invention. 
[0003] The European Patent Publication No. 
0671686A1 has disclosed a technique of doing dual 
writing through the effect of controllers spaced from 
each other by a long distance. In this technique, one 
controller guarantees data on a disk therein rf the other 
controller is broken by a disaster such as an earth- 
quake. The technique disclosed in EP-0671686A1 is 
arranged so that a primary controller directly receives 
write data from a host computer, transfers the received 
write data to a secondary controller located in a remote 
place, and reports completion of receipt of the write 
data to a host computer. This is a quite excellent 
method from a view-point of data security because the 
data to be stored in the primary controller is completely 
equivalent to the data to be stored in the secondary con- 
troller. However, a longer distance existing between two 
controllers makes a data transfer time between the con- 
trollers far larger. This method therefore has difficulty on 
performance to be solved if one controller is far away 
from the other controller. 

[0004] The European Patent Application Publication 
No. 0672985A1 also has disclosed a technique of doing 
dual writing on disks loaded in two controllers far away 
from each other. The technique disclosed in EP- 
0672985A1 is arranged so that a primary controller 
directly receives write data from a primary host compu- 
ter, immediately after receipt of the write data, reports 
completion of the receipt of the write data to the primary 
host computer. In the EP-0672985A1 , a copy of the 
write data received by the primary controller is read out 
to the primary host computer. According to the inven- 
tion, a time stamp is given to the write data received 
from the primary host computer for the first time. The 
time stamp indicates a time when a request for writing 
the write data is issued. When the copy of the write data 
is read out to the primary host computer, the write time 
is passed to the primary host computer as well. Then, 
the primary host computer sends the copy of the write 
data and the write time to a secondary host computer. 
[0005] When the secondary host computer receives 
the write data and the write time, information such as 
the write data is written on the disk for the control pur- 
pose. Further, the write data is written on the disk on the 
secondary side in the sequence of the write time by 
referring to the time given to each write data record. 
[0006] In the EP-06729851A1, the secondary host 



computer performs the foregoing procedure, because it 
is not desirable to leave the intermediate results of the 
transactions normally used by an online system. For 
example, considering a transaction of transferring a 

5 bank deposit from a bank account A to another bank 
account B, though the deposit is withdrawn from the 
bank account A, the state of having transferred no cor- 
responding deposit to the bank account B is not left. 
This means that no intermediate result of the transac- 

;o tion is left. Normally, the recovery unit is a transaction 
on the online system. Hence, leaving the intermediate 
result of the transaction is a quite significant obstacle. 
[0007] In turn, why no intermediate result of the trans- 
action can be left by the foregoing process will be briefly 

15 described below. The disks on which data is dually writ- 
ten contain a disk for storing a database such as 
account information and a disk for storing a journal 
where an update history of the transaction is left. If the 
host computer is failed, a recovery program is started to 

20 analyze the journal so that the update result of the 
unfinished transaction is returned to the state before the 
execution and no intermediate result of the transaction 
is left. The write data written on the disk loaded in the 
secondary controller is effective only in the cases such 

25 as when the primary controller for storing the latest write 
data is broken. The secondary controller does not store 
the latest write data but can guarantee the write data up 
to a certain time. Hence, apparently, the host computer 
yields an equivalent state to the failed state at a time 

30 when the write data is guaranteed. As a result, by using 
the disk for storing the journal, loaded in the secondary 
controller and the disk for storing the database, the sim- 
ilar process to the recovery to be executed when the 
host computer is failed is executed so that the interme- 

35 diate result of the transaction may not be left. 

[0008] Further, the disk controller contains a non-vol- 
atile cache memory and provides a write-after tech- 
nique, that is, a technique of writing the write data 
receive^ from the host computer onto a non-volatile 

40 cache memory and reporting the completion of the write 
to the host computer. The non-volatile cache memory is 
highly reliable, so that the data may be sufficiently guar- 
anteed by storing the write data in the cache. 
[0009] The technique disclosed in the EP-0672985A1 

45 is arranged to suppress the degrade of the performance 
though some data is lost even if a distance between the 
controllers is expanded. Further, it does not leave the 
intermediate result of the transaction. 



[0010] However, the technique disclosed in the EP- 
0672985A1 is ananged so that the primary host compu- 
ter reads out data and transfers the data to the second- 
55 ary host computer. Hence, unlike the EP-0671686A1 
ananged to directly transfer the write data between the 
controllers, this technique needs one more data trans- 
fers. Further, an I/O process to and from a storage 
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medium such as an MT is required to be done. 
[0011] It is an object of the present invention to pro- 
vide a technique which provides functions of directly 
transferring write data between controllers like the EP- 
0672985A1, suppressing degrade of the performance s 
to a minimum even if the distance between the control- 
lers is made longer, and leaving no intermediate result 
of a transaction. Further, no execution of the I/O proc- 
ess of control information to and from disks is required 
for enhancing the performance. 
[0012] Later, the description will be oriented to how 
the present invention realizes the aforementioned 
object. 

[0013] According to the present invention, the host 
computer gives a write time to each write data record 
when it issues a request for write to a primary controller. 
When the primary controller receives the write data 
from the host computer, it reports the completion of the 
receipt to the host computer. Then, the primary control- 
ler sends the write data records and the write times to 
the secondary controller. At this time, the primary con- 
troller operates to send the write data records to the 
secondary controller in the sequence of the writing 
times. These series of operations make it possible to 
suppress the degrade of the performance to a minimum 
even though the distance between the controllers is 
made longer. 

[001 4] The secondary controller operates to store the 
write data received from the primary controller onto a 
non-volatile cache memory. This makes it possible to 
guarantee the write data without any I/O process of con- 
trol information to and from disks. 
[0015] The secondary controller can guarantee the 
write data up to a certain time by referring to the 
received write time. This makes it possible to leave no 
intermediate result of the transaction. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0016] 

Fig. 1 is a diagram showing a general system con- 
figuration according to a first embodiment of the 
present invention; 

Fig. 2 is a view showing a format of write data man- 
aging information; 

Fig. 3 is an explanatory view showing a logical disk; 
Fig. 4 is an explanatory view showing a process of 
a secondary write data destaging unit in the case of 
executing the transfer of the write data from the pri- 
mary controller to the secondary controller; 
Fig. 5 is a diagram showing a general system con- 
figuration according to a second embodiment of the 
present invention; 

Fig. 6 is a diagram showing a general system con- 
figuration according to a third embodiment of the 
present invention; and 

Fig. 7 is a diagram showing a process to be exe- 



cuted when a master secondary controller collects 
information about the write time from the primary 
controller. 

DESCRIPTION OF THE PREFERRED EMBODI- 
MENTS 

[0017] Hereafter, the description will be oriented to 
embodiments of the present invention. 

(A) First Embodiment 

[0018] At first, the first embodiment will be described 
below. 

[0019] Fig. 1 illustrates a general configuration of the 
first embodiment. The system of the first embodiment is 
configured to have one or more processing units 100, 
one primary controller 104, one or more disk units 105 
connected to the primary controller 104, one secondary 
controller 109, and one or more disk units 105 con- 
nected to the secondary controller 109. The processing 
unit 100 is composed of a CPU 101, a main storage 
102, and a channel 103. The primary controller 104 
contains a control memory 107 and a cache memory 
108. The control memory 107 and the cache memory 
108 are non-volatilized. For enhancing the reliability, 
each memory may be dualized. The cache memory 108 
or the control memory 107 is composed of a semicon- 
ductor memory. As compared with the disk unit 105, the 
memory 1 07 or 108 provides a faster accessing capabil- 
ity by one or two digits. The primary controller 104 oper- 
ates to transfer data between the processing unit 100 
and the disk unit 105. Further, according to the present 
invention, the primary controller 104 provides a function 
of transferring data with the secondary controller 109. 
Or, the primary controller 104 contains one or more 
directors 106, each of which may operate to transfer 
data between the processing unit 200 and the disk unit 
205 and between the secondary controller 109 and the 
director 106 itself. The internal arrangement of the sec- 
ondary controller 109 is likewise to that of the primary 
controller 104. 

[0020] The write data managing information 1 13 cor- 
responding to the write data record 1 12 is created on 
the control memory 1 07. 

[0021 ] When the processing unit 1 00 issues a request 
for write 1 10 to the primary controller 104, the process- 
ing unit 1 00 operates to give a write time 111 to the write 
data record 1 12. The write time 1 1 1 represents a time 
when the request for write 110 is issued. The primary 
controller 104 and the secondary controller 109 can rec- 
ognize the sequence of the requests for write 110 
issued by the processing unit 100 by referring to this 
write time 1 1 1 . If two or more processing units 100 are 
provided, the write time 111 is defined by using the 
common clocks among the processing units 100, so 
that the primary controller 104 and the secondary con- 
troller 108 can recognize the sequence of the requests 
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for write 1 10 issued by different processing units 110. 
[0022] Fig. 2 illustrates a composition of write data 
managing information 113. Herein, in particular, the 
description will be oriented to the information directly 
concerned with the present invention. In the present 
invention, a disk to be specified when the processing 
unit 1 00 issues the request for write 1 1 0 is referred to as 
a logical disk. A logical disk ID 120 indicates a number 
of a logical disk indicated by the processing unit 100 so 
that the corresponding write data is written on the logi- 
cal disk and is contained in the request for write 110. 
This invention does not require the one-to-one corre- 
spondence between the logical disk recognized by the 
processing unit 100 and the disk unit 105 (physical 
disk). As illustrated in Fig. 3, the logical disk may be 
defined over two or more disk units 105. Further, the 
logical disk may contain redundant data and thus 
employ a RAID (Redundant Array of Inexpensive Disks) 
arrangement. A numeral 121 denotes a write address 
that is information (for example, information such as an 
area of 1 Mbyte starting from a head of the logical disk) 
for indicating an address inside of a logical disk where 
the corresponding write data is to be written and is con- 
tained in the request for write 110. A numeral 122 
denotes a write data length that represents a length of 
the corresponding write data and is contained in the 
request for write 110. These pieces of information are 
all contained in the normal request for write 110. A 
numeral 123 denotes a write data pointer that is a 
pointer to the corresponding write data 1 1 2 in the cache 
memory 108. The write time 111 has been already 
described above. The write time 111 to be given to the 
request for write 110 is a feature of the invention. A 
numeral 124 denotes a necessity bit that indicates that 
the write data is required to be transferred to the sec- 
ondary controller. (Herein, the bit 124 is referred to as a 
necessity bit 124.) This necessity bit 124 is information 
that represents transfer of the corresponding write data 

1 12 to the secondary controller 109. 

[0023] Another item of information contained in the 
control memory 107 is a secondary logical disk number 
114. This information is located to correspond to the 
logical disk of the primary controller 104 and contains 
the numbers of the secondary logical disk of the corre- 
sponding logical disk, which are paired for dual writing, 
that is, a number of the secondary controller 109 for 
loading the secondary logical disk and a logical disk 
number located in the secondary controller 109 of the 
secondary logical disk. Of course, if the logical disk 
does not have any pair for dual writing, a null value for 
indicating invalidity is put into the logical disk. 
[0024] The control memory 1 07 of the secondary con- 
troller 109 contains the write data managing information 

11 3 as well. Ttie information may have the same format 
as the write data managing information 1 13 in the pri- 
mary controller 104, in which the necessity bit 124 is 
constantly off. The control memory 107 of the second- 
ary controller 109 contains a primary logical disk 



number. The present information is located to corre- 
spond to the logical disk of the secondary controller 1 09 
and contains the numbers of the primary logical disk of 
the corresponding logical disk, which are paired for dual 

5 writing, that is, the number of the primary controller 1 04 
for loading the primary logical disk and the logical disk 
number in the primary controller 104 of the primary log- 
ical disk. Of course, if the logical disk does not contain 
any pair for dual writing, a null value for indicating inva- 

10 lidity is put into the logical disk. 

[0025] The primary controller 1 04 provides a primary 
write data receiving unit 130, which is started when it 
receives the request for write 110 from the processing 
unit 100. At first the received write data 1 12 is stored in 

15 the cache memory 108 (step 131). Next, the primary 
write data receiving unit 130 operates to secure the 
write data managing information 1 13 inside of the con- 
trol memory 108 for processing this request for write 
(step 132). Further, the information such as the write 

20 time 1 1 1 contained in the request for write is stored in 
the secured write data managing information 113 for 
setting a write data pointer 123 and the necessity bit 
1 24 (step 1 33). Finally, the completion of the request for 
write 110 is reported to the processing unit 100 (step 

25 134). The foregoing processing makes it possible to do 
a quick response because no access is required for the 
disk unit 105. The process of writing the write data 1 12 
onto the disk unit 105 is executed by the primary con- 
troller 1 04 at a later stage. This operation is executed by 

30 the normal controller. Hence, it is not described in detail. 
[0026] The primary controller 1 04 has a primary write 
data transmitting unit 140, which provides a function of 
transmitting the write data 1 12 to the secondary control- 
ler 109. At first, the write data record 1 12 at the earliest 

35 write time, included in the write data managing informa- 
tion 1 1 3 in which the necessity bit 1 24 is set, is transmit- 
ted to the secondary controller 109 having a pair for 
dual writing located therein by referring to the corre- 
sponding secondary logical disk number 114. The 

40 length of the write data 112 and the address inside of 
the secondary logical disk where the write data is to be 
written are specified by referring to the information con- 
tained in the write data managing information 113 (step 
141). Next, the primary write data transmitting unit 140 

45 waits for a report on the completion from the secondary 
controller 109 (step 142). When the report on the com- 
pletion is given back, the necessity bit 124 is set off 
(step 143). Then, the operation goes back to the step 
1 40 at which the next write data to be transmitted is tried 

50 to be found. 

[0027] The secondary controller 1 09 has a secondary 
write data receiving unit 160, which is started when the 
write data 112 is received from the primary controller 
104. The processing content of the secondary write 

55 data receiving unit 1 60 is likewise to that of the primary 
write data receiving unit 140 except that the setting of 
the write data managing information 113 does not con- 
tain a process of setting the necessity bit 124 (step 
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1 61). After the write data 112 from the primary controller 
104 is written in the cache 108. the completion of the 
write is reported to the primary controller 104. 
[0028] The secondary controller 1 09 has a secondary 
write data destaging unit 170, which provides a function 
of writing the write data 1 1 2 to the disk unit 105. At first, 
some write data records 112 included in the write data 
managing information 1 13 are determined to be written 
in the disk unit 105 in the sequence from the earliest 
write time. Then, a necessary calculation is executed to 
determine the disk unit 105 where the write data 
records are to be written and the write address. The 
method of calculation is not described in detail, because 
it is used for the normal RAID (step 171). Next, two or 
more requests for writing the write data 1 12 to the disk 
unit 105 are issued to the disk unit 105 in parallel (step 

172) . Next, the secondary write data destaging unit 170 
waits for a report on the completion of the request (step 

173) . After the reports on the completion of all the 
requests are received, the operation goes back to the 
step 171 at which the next write data 113 to be 
destaged to the next disk unit 105 is tried to be found. 
[0029] The transmitting sequence of the write data 
113 from the primary controller 104 to the secondary 
controller 109 is the sequence of the write time 111. 
Hence, the secondary controller 109 enables to gener- 
ate a state that it can hold all the write data records 113 
on or before a reference time and cannot hold ail the 
write data records 113 after the reference time. This 
operation makes it possible for the secondary controller 
109 to do a recovering process without leaving any 
intermediate result of a transaction if the primary con- 
troller 104 is broken. Further, on the secondary control- 
ler 109, the control information such as the write data 
1 12 and the write time 1 1 1 is held in a non-volatile sem- 
iconductor memory such as the cache memory 108 and 
the control memory 107. Hence, no large overhead is 
burdened on the performance. 

[0030] The aforementioned process may not provide 
sufficient performance because the transfer of the write 
data 1 12 from the primary controller 104 to the second- 
ary controller 109 is serialized. Fig. 4 illustrates the 
process executed in the case of transferring the write 
data 112 from the primary controller 104 to the second- 
ary controller 109 in parallel. In each processing unit, a 
primary write data transmitting unit a300, a primary ref- 
erence time transmitting unit 170, a secondary refer- 
ence time receiving unit 180, a secondary write data 
destaging unit a310, and a primary fail-time data scrap- 
ping unit 190 are changed from the configuration in 
which the transfer is serialized. 
[0031] Hereafter, the processing flow of the primary 
write data transmitting unit a300 will be described. At 
first, some write data records 1 12 included in the write 
data managing information 113 in which the necessity 
bit 124 is set are transferred in parallel to the secondary 
controller 109 having a pair for dual writing located 
therein by referring to the corresponding secondary log- 



ical disk number in the sequence from the earliest write 
time (step 301). Next, the primary write data transmit- 
ting unit a300 waits for a report on the completion of 
each write from the secondary controller 109 (step 302). 

5 After all the reports are given back, the necessity bit 1 24 
included in the corresponding write data managing 
information 1 13 is set off (step 303). Then, the operation 
goes back to the step 1 50 at which the next write data 
1 12 to be transmitted is tried to be found. 

w [0032] The execution of the parallel transfer of the 
write data records 1 1 2 may make each write time 1 1 1 of 
the write data records 112 be out of the sequence. 
Hence, it is necessary to recognize the reference write 
time 111 for determining the write data 112 to be 

15 destaged by the secondary controller 109. In this case, 
the write data 1 12 to be destaged is made to have the 
earlier write time 1 1 1 than the reference time corre- 
sponding to the earliest write time 1 1 1 among the write 
data included in the write data managing information 

20 113 where the necessity bit 124 is set on, which are all 
contained in the primary controller 104. This is because 
all the write data 112 having the earlier write time 1 1 1 
than the reference time are held on the side of the sec- 
ondary controller 1 09. On the other hand, the write data 

25 records 112 having the write time 1 1 1 on or before the 
reference time should not be destaged. If the primary 
controller 104 is broken, the write data records 112 have 
to be broken without being destaged. 
[0033] The primary reference time transmitting unit 

30 1 70 provides a function of transmitting the destageable 
reference time to the secondary controller 109. As men- 
tioned above, the reference time is the earliest write 
time 111 included in the write data managing informa- 
tion 1 1 3 where the necessity bit 1 24 is set on. 

35 [0034] The secondary reference time receiving unit 
180 operates to store the reference time received from 
the primary controller 104 in the control memory 108 as 
a destageable time 1 85. 

[0035] Fig. 4 illustrates a processing flow of the sec- 
40 ondary write data destaging unit a310 executed in the 
case of transmitting the write data records 112 from the 
primary controller 1 04 to the secondary controller 1 09 in 
parallel. The different respect of this flow from the 
processing flow shown in Fig. 1 is that the condition for 
45 selecting the write data 1 12 to be destaged includes a 
condition for determining if the write time 111 is on or 
before the destageable time 185 (step 31 1). 
[0036] The primary fail-time data scrapping unit 197 
provides a function of scrapping the write data 1 12 hav- 
so ing the write time 1 1 1 later than the destageable time 
185 when the primary controller 104 is broken (step 
191). 

(B) Second Embodiment 

55 

[0037] Next, the second embodiment will be described 
below. 

[0038] Fig. 5 illustrates a system configuration accord- 
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ing to a second embodiment of the present invention. 
The difference between the first and the second embod- 
iments is the number of the primary controllers 1 04. The 
system of the first embodiment is configured to have 
one primary controller 104 and one secondary control- 5 
ier 109, while the system of the second embodiment is 
configured to have two or more primary controllers 104 
and one secondary controller 209. 
[0039] The provision of two or more primary control- 
lers 1 04 makes the write time 1 1 1 of the write data 112 10 
received from one primary controller 104 shifted from 
that received from another primary controller 1 04 on the 
side of the secondary controller 109. The latest write 
time 111 of the write data 112 received from one pri- 
mary controller 104 (for example, the primary controller 15 
a) is assumed as a time a, while the latest write time 

1 1 1 of the write data 1 12 received from another primary 
controller 104 (for example, the primary controller b) is 
assumed as a time b. In this assumption, if the time a is 
earlier than the time b, the primary controller b may hold 20 
the write data 1 13 of the time later than the time a but 
earlier than the time b. As mentioned above, in order to 
erase the intermediate result of a transaction, it is nec- 
essary to guarantee all the write data records 1 13 of the 
write time 1 12 on or before a reference time and discard 25 
all the write data records 1 13 of the later write times 112 
than the reference time. Hence, the write data record 

1 1 2 of the write time 1 1 1 on or before the time a is made 
to be the write data record 1 12 to be destaged on the 
secondary controller 1 09. 30 
[0040] In correspondence with the above, the control 
memory 108 of the secondary controller 109 stores a 
write enable time 500 to the primary controller. The 
write enable time 500 is the information existing in each 
primary controller 104 and is the latest write time 111 35 
received from the corresponding primary controller 104. 

As mentioned above, therefore, of these write enable 
times 500. the write data record 112 of the write time 

1 1 1 on or before the reference time, that is, the earliest 
time corresponds to the write data record 112 to be 40 
destaged onto the secondary controller 109. 

[0041] Later, about the present embodiment, the 
description will be oriented to the content of each 
processing unit in the case of transferring the write data 

112 from one primary controller 104 to the secondary 45 
controller 109 in parallel. It goes without saying that the 
present embodiment is effective in the case of serializ- 
ing the transfer of the write data 112 from one primary 
controller 104 to the secondary controller 109. 

[0042] The processing flow of each processing unit so 
included in the primary controller 104 is likewise to the 
processing flow (the process shown in Fig. 3) executed 
in the case of transferring the write data 1 12 in parallel 
in the first embodiment. 

[0043] In turn, the description will be oriented to the 55 
processing flow of each processing unit included in the 
secondary controller 109. 

[0044] The description will be oriented to the process- 



ing flow of a secondary write data destaging unit b510. 
Herein, the description will be oriented to the difference 
between the processing flow of the secondary write 
data destaging unit 520 included in the second embodi- 
ment and the processing flow of the secondary write 
data destaging unit 170 included in the first embodi- 
ment. The content of the process executed in the sec- 
ondary write data receiving unit 510 according to the 
second embodiment is executed to check if the corre- 
sponding write time 1 1 1 is on or before all the write ena- 
ble times 500 and select the write data record 112 for 
meeting the condition (step 511). Except this process, 
the processing flow of the secondary write data destag- 
ing unit b510 included in the second embodiment is like- 
wise to the processing f bw of the secondary write data 
destaging unit 170 included in the first embodiment. 
[0045] The secondary reference time receiving unit 
b520 operates to set the reference time received from 
the primary controller 104 as a write enable time 500 of 
the primary controller 104 having transmitted the refer- 
ence time. 

[0046] In this embodiment, the write data 112 to be 
scrapped by the primary fail-time data scrapping unit 
b530 corresponds to the write data 112 that does not 
meet the condition that the corresponding write time 
1 1 1 is on or before the write enable times 500 of all the 
primary controllers (step 531). 

(C) Third Embodiment 

[0047] In turn, the description will be oriented to the 
third embodiment of the invention. 
[0048] Fig. 6 illustrates a general system configuration 
according to the third embodiment. The difference 
between the second and the third embodiments is also 
the numbers of the primary controllers 104 and the sec- 
ondary controllers 109. The system of the third embod- 
iment is configured to have two or more primary 
controllers 104 and one or more secondary controllers 
109. In this case, each pair of the primary controller 104 
and the secondary controller 109 is not required to be 
connected. 

[0049] If two or more secondary controllers 109 are 
provided, in order to erase the intermediate result of a 
transaction, it is necessary to keep the reference times 
used for selecting the write data records 112 to be 
destaged common among the secondary controllers 
109. This is because the database and the journal may 
be distributed among the secondary controllers 109. 
[0050] According to this embodiment, a master sec- 
ondary controller 700 provides a function of determining 
the reference time used for selecting the write data 
record 1 12 to be destaged. Hence, a data transfer path 
is connected between the master controller 700 and the 
other secondary controllers 109. If the data transfer 
path is failed, it is impossible to keep the reference 
times used for selecting the write data records 1 12 to be 
destaged common among the secondary controllers 
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1 09. Hence, multiplexing the data transfer path is prefer- 
able. In this embodiment the master secondary control- 
ler 700 provides a function of determining the reference 
time used for selecting the write data record 1 12 to be 
destaged. Without providing the specific secondary 
controller 109 with the function of determining the refer- 
ence time, the method for distributing the function 
among the secondary controllers 109 (for example, the 
method wherein the secondary controllers 109 alter- 
nately determine the reference time) may be employed 
for implementing the present invention. 
[0051 ] In correspondence with the above, the control 
memory 109 of the master secondary controller 700 
stores a write time 701 of the secondary controller. This 
write time 701 is the information corresponding to the 
secondary controllers 109 containing the master sec- 
ondary controller 700. The write time 701 of each sec- 
ondary controller corresponds to the information 
periodically received by the master secondary controller 
700 from each secondary controller 109 at the earliest 
time (the reference time on which the secondary con- 
troller 109 selects the secondary controller 109 in the 
second embodiment) among all the primary controller 
write enable times 500 contained in the secondary con- 
troller 109. 

[0052] In the third embodiment, the master write time 
702 is a reference time on which each secondary con- 
troller 109 selects the write data. The master write time 
702 is set by the process wherein the master secondary 
controller 700 refers to all the secondary controller write 
times 701 at a proper period and selects the earliest 
time. All the write data records 112 having the write 
times 1 1 1 on or before the selected time are contained 
in the secondary controller 109. Hence, by guarantee- 
ing the write data 112 that meet this condition and 
scrapping all the write data 112 that do not meet it, it is 
possible to erase the intermediate result of the transac- 
tion. 

[0053] Hereafter, also in this embodiment, the 
description will be oriented to the content of each 
processing unit in the case of transferring the write data 
112 from one primary controller 104 to the secondary 
controllers 109 in parallel. It goes without saying that 
this embodiment is effective if the transfer of the write 
data 1 12 from one primary controller 104 to the second- 
ary controllers 109 is serialized. 
[0054] The processing flow of each processing unit 
included in the primary controller 104 is substantially 
similar to that of the second embodiment. It goes with- 
out saying that the primary reference time transmitting 
unit 170 provides a function of transmitting a reference 
time to each secondary controller 109 for transmitting 
the write data 1 12. The reference time to be transmitted 
corresponds to the earliest write time 1 1 1 contained in 
the write data managing information 113 in which the 
necessity bit 124 of each write data record 112 corre- 
sponding to the secondary controller 109 for transmit- 
ting the reference time is set on. 



[0055] The different respect of the processing flow of 
the third embodiment from that of the second embodi- 
ment is that the secondary controller 109 contains a 
secondary write time transmitting unit 710, a secondary 

5 write data destaging unit c720, and a primary fail-time 
data scrapping point c, and the master secondary con- 
troller 700 contains a master secondary write time 
receiving unit 711, a master write time calculating unit 
712, and a master secondary write time transmitting 

10 unit 713. 

[0056] The secondary write time transmitting unit 71 0 
operates to transmit the earliest time 180 included in all 
the primary controller write enable times 500 stored in 
the primary controller 1 09 to the master secondary write 

75 time receiving unit 71 1 included in the master second- 
ary controller 700 at a proper period. The secondary 
write time transmitting unit 710 included in the second- 
ary controller 109 except the master secondary control- 
ler 700 makes use of a data transfer path between the 

20 secondary controllers 109. The secondary write time 
transmitting unit 710 of the master secondary controller 
700 makes use of communicating means provided in 
the master secondary controller 700. 
[0057] The master secondary write time receiving unit 

25 711 operates to set the time received from the second- 
ary write time transmitting unit 710 to the secondary 
controller write time 701 corresponding to the second- 
ary controller 109 having transmitted the time. 
[0058] The master write time calculating unit 712 

30 operates to refer to all the secondary controller write 
times 701 , select the earliest time, and then set the time 
as the master write time 702. 

[0059] The master secondary write time transmitting 
unit 713 operates to transmit the time set as the master 

35 write time 702 at a proper period in response to the 
requests issued from the secondary write data destag- 
ing unit 720 and the primary fail-time data scrapping unit 
730 included in each secondary controller 109. The 
secondary controller 109 rather than the master sec- 

40 ondary controller 700 makes use of the data transfer 
path between the secondary controllers 109. For the 
request issued from the master secondary controller 
700, the master secondary write time transmitting unit 
713 makes use of communicating means provided in 

45 the master secondary controller 700. 

[0060] The different respect of the secondary write 
data destaging unit 720 from that of the second embod- 
iment is that when selecting the write data 112 to be 
destaged, the secondary write data destaging unit 720 

so operates to receive the reference time from the master 
secondary write time transmitting unit 713 and select 
the write data 112 with the earlier write time 1 1 1 than 
the reference time as the write data to be destaged 
(step 721). 

55 [0061] The different respect of the primary fail-time 
data scrapping unit c730 from that of the second 
embodiment is that when selecting the write data 1 1 2 to 
be scrapped from the cache memory 107, the primary 
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fail-time data scrapping unit c730 operates to receive 
the reference time from the master secondary write time 
transmitting unit 713 and select the write data 112 
except ail the write data 112 with the earlier write time 
111 than the reference time as the write data to be 5 
scrapped (step 731). 

[0062] According to this embodiment, the master con- 
troller 700 is arranged to receive the information 
required for calculating the reference time from the sec- 
ondary controller 109. In place, it may be arranged to 10 
receive the information from the primary controller 104 
as shown in Fig. 7. In this case, the control memory 108 
of the master secondary controller 700 stores a master 
primary controller write time 800. The write time 800 
corresponds to the primary controller 104. The write 15 
time 800 is set as follows. The master secondary con- 
troller 700 receives the earliest write time 111 included 
in ail the write data managing information 113 contain- 
ing the necessity bits 124 being set at a proper period 
and sets it as the write time 800 inside of the primary 20 
controller 1 09. For the master write time 702, the master 
secondary controller 700 operates to refer to all the pri- 
mary controller write times 701, select the earliest time 
and set it as the master write time 702 at a proper 
period. Like the foregoing embodiments, the reference 25 
time for the destaging or the data scrapping is the mas- 
ter write time 702. 

Claims 

30 

1 . A computer system comprising: 

a processing unit (100); 
a first controller (104) connected to said 
processing unit and having a first cache mem- 35 
ory (108); 

a first disk unit (105) connected to said first 
controller; 

a second controller (109) connected to said 
first controller and having a second cache 40 
memory (108); 

a second disk (105) connected to said second 

controller; and 

wherein 

said processing unit issues a request for write 45 
containing write data and a write time to said 
first controller, 

said first controller (a) stores said write data in 
said first cache memory, (b) reports completion 
of said request for write to said processing unit, so 
(c) stores said write data stored in said first 
cache memory in said first disk unit, and (d) 
transmits said write data and said write time 
stored in said first cache memory to said sec- 
ond controller in the sequence of said write 55 
time, wherein said steps (c) and (d) are exe- 
cuted in parallel, and 

said second controller (e) stores said write data 



and write time transmitted from said first con- 
troller to said second cache memory and (f) 
stores said write data stored in said second 
cache memory to said second disk unit in the 
sequence of said write time. 

2. The computer system according to claim 1 , wherein 
after said step (e) is executed, said second control- 
ler transmits a report on completion to said first 
controller. 

3. The computer system according to claim 1 , wherein 
said first and second cache memories are non-vol- 
atile. 

4. Tlie computer system according to claim 1, further 
comprising: 

another processing unit; and 

wherein both of said processing units have 

common clocks to each other. 

5. The computer system, comprising: 

a processing unit (100); 
a first controller (104) connected to said 
processing unit and having a first cache mem- 
ory (108); 

a first disk unit (105) connected to said first 
controller; 

a second controller (109) connected to said 
first controller and having a second cache 
memory (108); 

a second disk unit (105) connected to said sec- 
ond controller; and 
wherein 

said processing unit issues a request for write 
containing write data and a write time to said 
first controller, 

said first controller (a) stores said write data in 
said first cache memory, (b) reports completion 
of said request for write to said processing unit, 
(c) stores said write data stored in said first 
cache memory to said first disk unit, (d) trans- 
mits plural pairs of said write data and said 
write time and a destageable time stored in 
said first cache memory to said second control- 
ler in the earlier sequence of said write times, 
wherein said steps (c) and (d) are executed in 
parallel, and 

said second controller (e) stores said write data 
arid said write time transmitted from said first 
controller in said second cache memory, and (f) 
stores in said second disk unit said write data 
corresponding to said earlier write time than 
said destageable time and stored in said sec- 
ond cache memory. 
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6. The computer system accorcfing to claim 5, wherein 
after said step (e) is executed, said second control- 
ler transmits a report on the completion to said first 
controller. 

5 

7. The computer system according to claim 5, wherein 
said first and second cache memories are non-vol- 
atile. 

8. The computer system according to claim 5, further 10 
comprising: 

another processing unit; and 

wherein both of said processing units have 

common clocks. 15 
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